Apparatus, method, and computer program product for synchronizing data sources

ABSTRACT

A method is provided for synchronizing data sources. The method includes receiving at least first and second data tables. The data tables have one or more mutually similar fields and one or more dissimilar fields from one another. First normalized, second normalized, and intermediate data tables are stored, each including respective first normalized, second normalized, and intermediate data table fields that each correspond to the mutually similar fields of the first and second data tables. The first normalized data table is at least partially populated with corresponding entries from the first data table and the second normalized data table is at least partially populated with corresponding entries in the second data table. Intermediate data table entries are respectively replaced with corresponding inconsistent data entries of the first and second normalized data tables. An apparatus and a computer program product for accomplishing the above method are also provided.

BACKGROUND INFORMATION

Address books, including electronic address books, are widely used. Inmany cases, such electronic address books are associated with aparticular application, a particular device or data source, or both.When an individual user maintains multiple address books, it may bedesirable to synchronize the various address books such that eachaddress book contains current information. However, it can be tedious tomanually update each address book with the same information. As such,some methods of synchronizing address books in a somewhat automatedfashion have been developed.

For existing automated synchronization methods, address booksynchronization is usually limited to synchronization between two datasources that have the same data schema (e.g., data sources includingdata tables having the same table fields). However, synchronizing databetween data sources with dissimilar data schema remains a challenge.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING(S)

FIG. 1 is a schematic view of a system for synchronizing data sourcesconfigured in accordance with an exemplary embodiment.

FIG. 2 is a schematic view of the system of FIG. 1, exemplifying datatables associated with the two data sources.

FIG. 3 is a schematic view of the system of FIG. 1, exemplifying ahardware configuration for the various components.

FIGS. 4 a and 4 b show an embodiment of a data format for data stringsthat may be passed between the data sources and the proxy server.

FIG. 5 is a schematic view of the proxy server and two data sources,showing exemplary data tables contained therein.

FIG. 6 is a flowchart representing a method for synchronizing datasources, the method being in accordance with an exemplary embodiment.

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

Exemplary embodiments now will be described hereinafter with referenceto the accompanying drawings, in which exemplary embodiments andexamples are shown. Like numbers refer to like elements throughout.

Referring to FIGS. 1-3, therein is shown a system 100 for facilitatingsynchronization of data tables, such as address books, the system 100being constructed in accordance with an exemplary embodiment. System 100includes several data sources, including a mobile device, such ascellular telephone 102 or mobile data device, and a storage unit, suchas server 104. A processing unit, proxy server 106 in the illustratedembodiment, communicates with cellular telephone 102 and server 104.

Cellular telephone 102 includes a first data table 108, stored, forexample, in a memory 103 or other storage device of cellular telephone102. First data table 108 has one or more fields 110 a-n, some or all ofwhich may be populated with data entries of varying types. In theillustrated embodiment, data entries 112 a-d populate four of the fields110 a-d, while other fields remains empty. In the illustratedembodiment, the data entries 112 a-d are indicative of persons' namesand associated telephone numbers, although other types of data can becontained in the fields 110 a-n, including textual, numeric, and/orsymbolic data indicative of any information, including, for example, anaddress, a fax number, a birthday, an email address, a bank statement, acalendar, etc. First data table 108 can have any number of fields.

Another data table, second data table 114, is associated with, forexample, may be stored in a memory 105 of, server 104. Second data table114 has one or more fields 116 a-n, some or all of which may bepopulated with data entries of varying types. In the illustratedembodiment, data entries 118 a-i populate nine of the fields 116 a-i,while other fields remain empty. In the illustrated embodiment, the dataentries 118 a-i are indicative of a person's name, separated into firstname, middle initial, and last name fields, and associated address andplurality of telephone numbers, although other types of data can becontained in the fields 116 a-n. Second data table 114 can have anynumber of fields.

It is noted that first and second data tables 108, 114 include one ormore mutually similar fields for storing overlapping types of data. Forexample, fields 110 a and 110 c of first data table 108 are fields forstoring name-type data, as are fields 116 a-c of second data table 114.These fields are therefore referred to as being similar, in that theystore overlapping types of data to the extent that there is at leastsome, albeit incomplete, coincidence of the content of the data. Thesame statements apply to fields 110 b, 110 d, and 116 d (and possibly116 e), which are all configured to store telephone number data. Also,it is noted that field 110 a and fields 116 a-c all store nameinformation for “Thomas Smith,” although in second data table 114, thefull name stored is “Thomas A. Smith.” Further, the name information offirst data table 108 is formatted differently than the name informationof second data table 114, in the prior case being contained in a singlefield 110 a and in the latter case being split between three fields 116a-c. As such, fields 110 a and 116 a-c are referred to as related orcorresponding, and data entry 112 a corresponds to, but is notcoincident with, data entries 118 a-c. A key can be associated with eachof the fields of a data table in order to indicate correspondence ofdifferent fields. In one embodiment, the name field can act as the key,such that all data entries can be associated with a name entry, althoughother keys, such as a numerical indicator associated with all relatedfields, can also be used.

One or both of the first and second data tables 108, 114 may also haveone or more dissimilar fields for storing types of data that areexclusive to the respective table. For example, in the illustratedembodiment, second data table 114 includes data related to an address infields 116 f-i, while first data table 108 is not configured to storeaddress-type data. In this case, the address fields 116 f-i are referredto as dissimilar fields, associated only with the second data table 114and having no counterpart fields from the first data table 108configured to store that type of information. In this illustratedembodiment, field 116 e also is a dissimilar field, in that it pertainsto a second telephone number (for example, a facsimile number) to beassociated with a person's name, while the first data table 108 lackssuch a field.

Referring to FIGS. 1-3, 4 a, and 4 b, proxy server 106 can be configuredto identify corresponding inconsistent data entries of correspondingfields of the first and second data tables 108, 114. As used herein,corresponding data entries are those entries in different data tablesthat refer to the same substantive data, e.g., the data entries of twodifferent data tables that each refer to the same home telephone numberfor the same person. For example, proxy server 106 may include acommunications device 121, such as a network connector and/or wirelessmodem, a processing device 107, and a memory 120 or other storagedevice, all in communication with each other. The communications device121 may facilitate receipt of first and second data tables 108, 114 forstorage in memory 120. Processing device 107 may then compare the dataentries of fields that are determined to be corresponding. For example,each field may be represented by a data string 190 including a key 192(in this case, “TSmith”), a data-type identifier 194 (here, “PhHm” forhome telephone number), and substantive data 196 (e.g., 1234567890),each separated by a character separator, “|”. The key 192 indicates thegroup to which the data string 190 belongs (e.g., for address book data,the key 192 may refer to a specific person or contact), and thedata-type identifier indicates the type of data (e.g., home telephonenumber) involved. Processing device 107 may use key 192 and data-typeidentifier 194 to ascertain correspondence information for the datastring 190 representing the field, that is, to identify other dataentries in the same or different data table(s) that refer to the samesubstantive data. Other formats for data string 190 are also possible.

As an example, processing device 107 might compare the entry 112 a offield 110 a to the data entries 118 a-c of fields 116 a-c. This could bedone, for example, by concatenating the entries 118 a-c and thencomparing the aggregated entry to entry 110 a. Other methods ofcomparison are also possible. Again referring to the illustratedembodiment, processing device 107 might compare the telephone numberdata 112 b, 118 d in fields 110 b, 116 d, thereby determining that theentries are not consistent. Processing device 107 can perform the abovedescribed tasks, for example, under control of, i.e., executing,software stored in memory 120, or in other ways.

In response to the above comparison, proxy server 106 may generate adissimilarity indicator for communication to cellular telephone 102and/or server 104. This dissimilarity indicator, for example, can be asignal describing the differences between the compared correspondingentries of corresponding fields. Referring to the above example,comparison of data entries 112 b and 118 d might yield a signal, fortransmission to cellular telephone 102, indicating that the last “4” hasbeen changed to a “5” (or a signal indicating just the opposite fortransmission to server 104). The signal may allow modification of thefirst and/or second data tables 108, 114 so as to synchronize, e.g., setto equal values, the corresponding fields of the first and second datatables 108, 114. For example, the signal may include one or moreinstructions governing modification of the first and/or second tables bythe cellular telephone 102 and/or server 104, respectively, or may allowmodification by the processing device 107 and subsequent transmission ofthe modified entry to the cellular telephone 102 and/or server 104 viathe communications device 121 for replacing entries stored locally.Referring to the illustrated embodiment, the processing device 107 mayproduce and/or execute an instruction that that causes the telephonenumber 118 d stored in field 116 d to be changed from its present valueto the value stored in field 110 b.

Server 104 may communicate with one or more user interfaces, such as avoice portal 122 to be accessed from a telephone, or a user inputdevice, such as a keyboard 124, and a display device 126. These userinterfaces may allow a user to manually alter the data entries in thesecond data table 114, as well as to view the entries of the second datatable. Similarly, cellular phone 102 may include or otherwise beassociated with one or more user interfaces, such as a keypad 128 and/oran LCD display screen 130, that allow interaction with the first datatable 108.

In some embodiments, the proxy server 106 determines a priority betweenthe corresponding inconsistent data entries, for example, via processingdevice 107. This determination of priority allows the processing device107 to determine which of two inconsistent data entries fromcorresponding fields should be replaced. Referring to the illustratedembodiment, field 110 b of first data table 108 includes telephonenumber “321 555 1234” (data entry 112 b), while field 116 d of seconddata table 114 includes telephone number “321 555 1235” (data entry 118d). In the present example, these data entries both refer to a telephonenumber for the same person, and they are therefore corresponding dataentries. However, the data entries are not coincident, and they aretherefore corresponding inconsistent data entries. If the processingdevice 107 determines that the data entry 118 d of field 116 d has ahigher priority, then the data entry 118 d will be used to replace thedata entry 112 b, such that field 110 b will include the telephonenumber “321 555 1235.” As such, the fields 110 b and 116 d areultimately synchronized. Examples of how priority might be determinedare provided below.

The above priority determinations can be made independently for each setof corresponding inconsistent data entries. For example, looking at therespective telephone number fields 110 b, 116 d and respective namefields 110 a, 116 a-c of the first and second data tables 108, 114, ahigher priority may be given to the telephone number in field 110 b offirst data table 108, such that data entry in field 116 d is replacedwith the telephone number in field 110 b. Simultaneously, a higherpriority may be given to the name data in fields 116 a-c of second datatable 114, such that these data from second data table 114 are used toreplace the name entry in first data table 108. This process may berepeated for some or all of the fields of the first and second datatables.

Referring to FIGS. 5 and 6, therein is shown a schematic representationof the operation of a system 200 for synchronizing data tables, thesystem including a proxy server 206 that employs a method 300 ofdetermining priority between corresponding inconsistent entries of firstand second data tables 208, 214 respectively associated with acommunications device, such as cellular telephone 202, and a storageunit, such as server 204. Raw device data table 208 and raw server datatable 214 could be, for example, electronic address books maintained inthe cellular telephone 202 and server 204, respectively. Entries in rawtables 208, 214 may be in any format, the format being dictated, in someembodiments, by the type of device within which each table is stored.

At Block 302, first and second data tables 208, 214 are received atproxy server 206 and stored as raw device data table 240 and raw serverdata table 244, respectively. In some embodiments, this may amount tocompletely importing and storing first and second data tables 208, 214in proxy server 206. Fields of the data tables may be associated withidentifying information, such as a key and data-type identifier.

Raw device table 240 and raw server table 244 are utilized by the proxyserver, at Block 304, to generate normalized device data table 248 and anormalized server data table 252, respectively. Fields 250 a-n, 254 a-nin the normalized tables 248, 252 correspond to mutually similar fieldsof the raw device table 208 and raw server table 214, perhaps determinedby the proxy server using key and data-type identifiers. Data entries infields 210 a-n and 216 a-n of raw data tables 208, 214 may be normalizedor placed in a common format by the proxy server, with the normalizedentries used to populate the normalized tables 248, 252. Producing dataentries in a common format may, in some cases, involve several steps.For example, referring to data entries 112 a and 116 a-c of FIG. 2, nameinformation in fields 116 a-c can be aggregated into a single name entrysimilar to that in field 112 a. Then, both the name data in field 112 aand the aggregated name data from fields 116 a-c can be reformatted,say, to be formed entirely of lower case lettering. In other cases,normalization may require fewer steps. For example, in order tonormalize the telephone number data entries of fields 112 b and 116 d,the telephone number data could be formed entirely of numerals (removingparentheses and hyphens). Each normalized data entry could be associatedwith a key, such as a numerical identifier or the entry in the namefield.

The proxy server 206 can include an intermediate data table 256 havingintermediate data table fields, such as “last synchronization” fields258 a-n and “new synchronization” fields 268 a-n, each set of fieldscorresponding to the mutually similar fields of the raw device and rawserver data tables 240, 244 (i.e., each set of the last synchronizationfields and new synchronization fields includes all of the same fields asfound in the normalized data tables 248, 252). At Block 306, dataentries from fields 250 a-n and 254 a-n of the normalized device andserver data tables 248, 252 are respectively compared by the proxyserver to corresponding data entries in last synchronization fields 258a-n. For each case where a data entry in normalized device table 248 isinconsistent with, e.g., does not equal, the corresponding data entry inthe corresponding field of the last synchronization fields 258 a-n, theproxy server stores the data entry in the normalized device table 248,at Block 308, to a delta device data table 262 (also maintained by theproxy server) along with a modification instruction, such as anindicator as to whether the corresponding data entry of the normalizeddevice table 248 has been added (i.e., exists in normalized data table248 but not in last synchronization fields 258 a-n), deleted (i.e.,exists in last synchronization fields 258 a-n but not in normalized datatable 248), or updated (i.e., exists in both normalized data table 248and last synchronization fields 258 a-n, but is dissimilar between thetwo) since the preceding synchronization (the preceding synchronizationbeing represented by last synchronization fields 258 a-n, as discussedbelow). The same comparison is made by the proxy server between the dataentries in normalized server table 252 and the corresponding dataentries in the corresponding fields of the last synchronization fields258 a-n, and inconsistent data entries in the normalized server table252 are stored to a delta server data table 266 (also maintained by theproxy server) along with an indicator as to whether each data entry hasbeen added, deleted, or updated since the last synchronization. Thedelta tables 262, 266 therefore serve as indications of dissimilaritybetween the normalized data tables 248, 252 and the last synchronizationfields 258 a-n.

At block 310, entries of the delta tables 262, 266 may be utilized withthe last synchronization fields 258 a-n to produce the newsynchronization fields 268 a-n. For example, entries in lastsynchronization fields 258 a-n may be copied to new synchronizationfields 268 a-n, and then the entries in new synchronization fields 268a-n may be modified as specified in delta device and server tables 262,266. New synchronization fields 268 a-n therefore represent an updatedset of data entries of the normalized device and server tables 248, 252.A new raw synchronization table 270 containing non-normalized (that is,in a format as received by the proxy server rather than the standardizedformat imposed by the proxy server; original or raw format in someembodiments) counterpart data entries to those of the newsynchronization fields 268 a-n can also be generated by the proxyserver, either subsequent to or in parallel with generation of the newsynchronization fields 268 a-n.

At Block 312, normalized device data table 248 and normalized serverdata table 252 are each compared by the proxy server to thecorresponding fields of the new synchronization fields 268 a-n toproduce second delta device data table 272 and second delta server datatable 274, respectively. For example, this process may proceed in afashion similar to that described in conjunction with Block 308. Each ofsecond delta device data table 272 and second delta server data table274 is indicative of changes to be made to the normalized device andserver data tables 248, 252, respectively, and ultimately to first andsecond data tables 208, 214 of cellular telephone 202 and server 204,respectively, in order to synchronize the two.

As illustrated above, second delta device data table 272 and seconddelta server data table 274 serve as indicators of dissimilarity betweennormalized device table 248 and normalized server table 252 before thesynchronization process. It is also noted that the system has determineda priority between corresponding, but mutually inconsistent, dataentries of the normalized device and server data tables 248, 252, inthat the respective entry that differs from that contained in thecorresponding field of the new synchronization fields 268 a-n is the onethat is assumed to have priority and dictate modification of the other.In cases where corresponding entries of normalized device and serverdata tables 248, 252 are both inconsistent with the corresponding dataentry in the new synchronization fields 268 a-n, priority can bedetermined in other predefined manners. For example, a time of mostrecent editing can be related to each of the data entries and the laterone in time may take priority, or a convention can be utilized, such asa convention in which the entry from the server always takes priorityover the data entry initiated in the cellular telephone.

At Block 314, for each data record in second delta device data table272, the corresponding normalized data entry of the normalized devicedata table 248 is used to create one or more equivalent data entries innon-normalized format (e.g., by retrieving such entries from new rawsynchronization table 270) that are stored to raw delta device datatable 276. Also, for each data record in second delta server data table274, the corresponding normalized data entry of the normalized serverdata table 252 is used to create one or more equivalent data entries innon-normalized format that are stored to raw delta server data table278. Once the raw delta tables 276, 278 are formed, these can betransmitted, at Block 316, to the cellular telephone 202 and server 204,respectively, in order to allow updating and synchronizing of the firstand second data tables 208, 214.

At Block 318, the data entries stored in new synchronization fields 268a-n may be saved to last synchronization fields 258 a-n. Lastsynchronization fields 258 a-n therefore serve as a record of the statusof the device and server data tables at the time of the last completedsynchronization process. Last synchronization fields 258 a-n are thenready for utilization in subsequent synchronization processes. A lastraw synchronization table 280 can also be stored in the proxy server,the entries in last raw synchronization table 280 being thenon-normalized equivalents to the entries saved to the lastsynchronization fields 258 a-n.

In some embodiments, the proxy server will track each subsequentsynchronization event, for example, by generating a unique keyassociated with a synchronization event, or by incrementing a counterfollowing each synchronization event. The key/counter may be maintained,for example, in the proxy server, as well as in the devices for whichdata tables are being synchronized. In one embodiment, the key/counterstored in each device involved in a synchronization event will becompared to the key/counter stored in other devices involved in thesynchronization event. For example, the proxy server may, when updatingnew synchronization fields 268 a-n from one or both of the delta serverand/or device tables 262, 266, compare the key/counter between twodevices undergoing synchronization. If the key/counter is not the samefor each device, the proxy server may ignore any instructions stored inthe delta server and/or device tables 262, 266 indicating deletion ofentries. The proxy server may be configured to ignore deletioninstructions for one specific device in the pair being synchronized orfor both devices being synchronized.

It should be noted that functions of the above process may bedistributed in a variety of ways between the cellular telephone, theserver, and the proxy server. For example, while the process has beendescribed as involving receipt by the proxy server of data tables from acellular telephone and server, and subsequent normalization of thosedata tables at the proxy server, it is also possible for the cellulartelephone and/or server to normalize the data tables locally andtransmit the normalized tables to the proxy server for furtherprocessing. This interchangeability of equipment carrying out the abovefunctions is present for most, if not all, steps of the above describedprocess.

FIG. 6 is a flowchart of a method according to an exemplary embodiment,and this flowchart is also representative of a system and programproduct according to exemplary embodiments. It will be understood thateach block or step of the flowchart, and combinations of blocks in theflowchart, can be implemented by various means, such as hardware,firmware, and/or software including one or more computer programinstructions. In this regard, the computer program instructions whichembody the procedures described above may be stored by a memory deviceof a computing device, such as the control server or the portals, andexecuted by a built-in processor of the computing device. As will beappreciated, any such computer program instructions may be loaded onto acomputer or other programmable apparatus (i.e., hardware) to produce amachine, such that the instructions which execute on the computer orother programmable apparatus create means for implementing the functionsspecified in the flowcharts block(s) or step(s). These computer programinstructions may also be stored in a computer-readable memory that candirect a computer or other programmable apparatus to function in aparticular manner, such that the instructions stored in thecomputer-readable memory produce an article of manufacture includinginstruction means which implement the function specified in theflowcharts block(s) or step(s). The computer program instructions mayalso be loaded onto a computer or other programmable apparatus to causea series of operational steps to be performed on the computer or otherprogrammable apparatus to produce a computer-implemented process suchthat the instructions which execute on the computer or otherprogrammable apparatus provide steps for implementing the functionsspecified in the flowcharts block(s) or step(s).

Accordingly, blocks or steps of the flowcharts support combinations ofmeans for performing the specified functions, combinations of steps forperforming the specified functions and program instruction means forperforming the specified functions. It will also be understood that oneor more blocks or steps of the flowcharts, and combinations of blocks orsteps in the flowcharts, can be implemented by special purposehardware-based computer systems which perform the specified functions orsteps, or combinations of special purpose hardware and computerinstructions.

In the preceding specification, various embodiments of the claimedinvention have been described. It will, however, be evident that variousmodifications and changes may be made thereunto without departing fromthe broader spirit and scope of the invention as set forth in the claimsthat follow. For example, many of the above examples have referred tomultiple tables stored in a memory or other storage device. However, itshould be understood that some, if not all, of the tables discussedmight be integrated into a single table or a smaller number of tables,with each such table including fields corresponding to what was formerlytermed a “table.” Also, many of the above examples have focused onsynchronization between two data tables stored in two independentdevices, and data tables stored in a cellular telephone and a server,However, the present invention is not so limited, but can be applied insynchronizing any number of data tables contained in any number ofseparate devices. The specification and drawings are accordingly to beregarded in an illustrative rather than restrictive sense.

1. An apparatus comprising: a storage device for storing first andsecond data tables, each table having one or more fields of which atleast some are populated with data entries, both data tables having oneor more mutually similar fields and at least one of the data tableshaving one or more dissimilar fields that are dissimilar to fields ofthe other data table; and a processing device in communication with saidstorage device, said processing device being configured to modify thefirst and second data tables so as to synchronize the similar fields ofthe first and second data tables.
 2. An apparatus according to claim 1,further comprising a communications device configured to receive from atleast one separate device, and to communicate to said storage device,the first and second data tables.
 3. An apparatus according to claim 2,wherein the communications device is further configured to communicateat least one of the first and second data tables, subsequent tomodification, to at least one separate device.
 4. An apparatus accordingto claim 2, wherein said communications device is further configured toreceive from the at least one separate device, and communicate to saidstorage device, at least one updated data table that replaces at least arespective one of the first and second data tables previously stored insaid storage device.
 5. An apparatus according to claim 2, wherein saidcommunications device is configured to communicate with at least oneseparate device that is selected from the group consisting of a mobilephone and a mobile data device.
 6. An apparatus according to claim 1,wherein said processing device is further configured to determine apriority between respective differing data entries from similar fieldsof the first and second data tables and to modify both of the first andsecond data tables so as to synchronize the similar fields of the firstand second data tables by replacing each differing data entry of lowerpriority with the corresponding differing data entry of higher priority.7. An apparatus according to claim 6, wherein said storage deviceincludes an intermediate data table having intermediate data tablefields corresponding to the mutually similar fields of the first andsecond data tables, at least some of the intermediate data table fieldsbeing populated with previously stored intermediate data table entries,and wherein said processing device is configured to determine a priorityand generate a dissimilarity indicator by using data entries of thefirst and second data tables to respectively replace correspondinginconsistent intermediate data table entries and subsequently usingintermediate data table entries to respectively replace correspondinginconsistent first and second data table entries.
 8. An apparatusaccording to claim 6, wherein said storage device includes anintermediate data table having intermediate data table fieldscorresponding to the mutually similar fields of the first and seconddata tables, at least some of the intermediate data table fields beingpopulated with previously stored intermediate data table entries, andwherein said storage device includes first and second normalized datatables with respective first and second normalized data table fieldscorresponding to the mutually similar fields of the first and seconddata tables, and said processing device is further configured to atleast partially populate the first normalized data table withcorresponding entries from the first data table and the secondnormalized data table with corresponding entries in the second datatable, and wherein said processing unit is configured to determine apriority and generate a dissimilarity indicator by using data entries ofthe first and second normalized data tables to respectively replacecorresponding inconsistent intermediate data table entries andsubsequently using intermediate data table entries to respectivelyreplace corresponding inconsistent first and second normalized datatable entries.
 9. An apparatus according to claim 8, wherein the entriesin the first and second normalized data tables and in the intermediatedata table are in a common data format, and wherein the processingdevice is further configured to use the entries in the first and secondnormalized data tables in common data format to generate data inalternative formats compatible with an external device. 10-20.(canceled)